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Abstract 



We study the asymptotics of the p-mapping model of random mappings on [n] as 
n gets large, under a large class of asymptotic regimes for the underlying distribution 
p. We encode these random mappings in random walks which are shown to converge 
to a functional of the exploration process of inhomogeneous random trees, this 
I exploration process being derived (Aldous-Miermont-Pitman 2003) from a bridge 

^ ' with exchangeable increments. Our setting generalizes previous results by allowing 
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1 Introduction 



We study the asymptotic behavior as n ^ oo of random elements of the set [n]^"'^ of 
mappings from [n] = {1,2, ... ,n} to [n]. Given a probabihty measure p = {pi, . . . ,pn) on 
[n], define a random mapping M as follows: for each i G [n], map i to j with probability 
Pj, independently over different i's, so that 

P(M = m) = J] , m G [np. (1) 

iG[n] 

The random mapping M is called the p-mapping. In what follows, we will not be con- 
cerned about keeping track of the labels of the mapping's digraph, so we will suppose 
that the probability p is ranked, i.e. Pi > P2 > ■ ■ • > Pn > 0. 

Now consider a sequence of such probabilities p„ = . . . ,p„„). Weak convergence 
of the associated p-mappings M„ as n ^ oo has been studied when p„ satisfies an 



asymptotic negligibility condition, namely, letting o"(p„) = (X]i<i<nPnj; j 



maXjgM p„ 



, , - 0. (2) 

Under this hypothesis, it has been shown that several features of the p-mapping, such 
as sizes of basins and number of cyclic points, can be described asymptotically in terms 
of certain functionals of reflected Brownian bridge (this was originally proved in for 
the uniform case Pni = ^/n). The two basic ingredients in the methodology of are: 

(i) Code the random mapping into a mapping-walk H'^^'^ that contains enough information 
about the mapping; 

(ii) use a random bijection, called the Joyal correspondence jHj, that maps p-mappings 
into random doubly-rooted trees, called p-trees, whose behavior is better understood. 

In particular, the limits in law of associated encoding random walks can be shown to 
converge to twice normalized Brownian excursion under condition and this informa- 
tion lifts back to mappings, implying that the rescaled mapping walks converge weakly 
to twice standard reflecting Brownian bridge; that is, a{pn)H^^" 2B^^^^ according to a 
certain topology on cadlag functions. Results provable via this methodology encompass 
those proved in [9] by somewhat different methods. 

The goal of this paper is to extend this methodology to more general asymptotic 
regimes for the distribution p, under the natural assumption maxjg[„]p„j — > as n ^ oo. 
In these more general regimes, several p- values are comparable to cr(p„) instead of being 
negligible. Precisely, we will assume there exists = {9i, O2, . . .) such that 

maxp„j — * and ^"^ —>■ 6i , i > 1. (3) 

By Fatou's Lemma, such a limiting must satisfy Yli^i — t)ut may be finite or 

infinite. We let Oq = \/l — Of. A vertex i > 1 with 6i > then corresponds to a "hub" 
or "attracting center" [HI for the mapping, because significantly many more integers 
are likely to be mapped to it as n gets large than to those for which 6i = 0. Our main 
result (Theorem H} roughly states that for p„ satisfying with = {9i, ... ,9j, 0,0, .. .) 
with 9i > and > (the subset of such 0's is called ©finite), we have weak convergence 

a{p^)H''" ^ (4) 
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for a certain continuous process to be described in section IT!?! where the topology is in 
general slightly weaker than the usual Skorokhod topology. We will also provide criteria 
under which the stronger convergence holds. In turn, we will see how this convergence 
and related results give information on the size of the basins of M„, and on the number 
of cyclic points, which in the limit arise as a kind of local time at for . 

To implement our methodology, the key point is that under (jH)), the p-trees are known 
to converge in a certain sense (Proposition to an Inhomogeneous Continuum Random 
Tree (ICRT) which we denote by T^. This family of trees was first investigated in j3| 
in the context of the additive coalescent. What is important for this paper is the recent 
result |2| that a certain class of ICRT's are encoded into random excursion functions H^, 
just as the Brownian tree is encoded into twice the normalized Brownian excursion. The 
definition of is recalled in section 12.31 where we also give the definition of the process 
functional of H . 

So the contribution of this paper is to show how the ideas from [1^ (in particular, 
the Joyal functional featuring in our Lemma may be combined with the result of j2] 
to prove the limit result indicated at (Q. Once these ingredients are assembled, only a 
modest amount of new technicalities (e.g part (ii) of Theorem |21 and its use in the proof 
of Theorem ^ will be required. One reason why "only modest" is our restriction to the 
case ©finite- In |2] it is shown that the construction of and associated limit results for 
p-trees work in the more general setting where < It seems very likely that our 

new result (Theorem also extends to this setting, but the technicalities become more 
complicated. 

While the existence of a limit process Z^ provides qualitative information about as- 
pects of the p-mappings, enabling one to show that various limit distributions exist and 
equal distributions of certain functionals of Z^, obtaining explicit formulas for such dis- 
tributions remains a challenging open problem. 

2 Statement of results 
2.1 Mappings, trees, walks 

We first introduce some notation which is mostly taken from fp. If m is a mapping on 
some finite set S, let V{m) be the directed graph with vertex set S, whose edges are 
s — >■ m(s), and let C(m) be the set of cyclic points, which is further partitioned into 
disjoint cycles, s and s' belonging to the same cycle if one is mapped to the other by some 
iterate of m. For c G C{m), if we remove the edges c m(c) and c' — > c where c' is the 
unique point of S* fl C(m) that is mapped to c, the component of V{m) containing c is a 
tree Tc{m) which we root at c. Label the disjoint cycles of m as Ci(m),C2(m), . . . with 
some ordering convention, then this in turn induces an order on the basins of m: 

c£Cj (m) 

q-biased order. The ordering we will consider in this paper uses a convenient extra 
randomization, yet we mention that results similar to this paper's could be established for 
different choices of basins ordering using similar methods. See e.g. [H], where two different 
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choices of ordering lead to two intricate decompositions of Brownian bridge. Given q, 
a probability distribution on S with qs > for every s E S, consider an i.i.d. q-sample 
(X2, X3, . . .) indexed by {2, 3, . . .}. If m is a random mapping, we choose the q-sample 
independently of m. Since > for every s E S, the following procedure a.s. terminates: 

• Let Ti = 2 and let Bi{m) be the basin of m containing X2. 

• If Ui<i<jBi{m) = S then end the procedure; else, given tj let Tj+i = mf{k : ^ 
Ui<i<jBi{m)} and let Bj+i be the basin containing X,-.^^. 

This induces an order on basins of m, and then on the corresponding cycles. We add 
a further order on the cyclic points themselves by letting cj be the cyclic point of Cj{m) 
such that G , and by ordering the cyclic points within Cj (m) as follows: 

m{cj) ~< m^{cj) -<...-< ml^^("'^l"^(cj) -< Cj. 

This extends to a linear order on C{m) by further letting Cj_i -< m{cj). We call this 
(random) order on cyclic points and basins the (\-hiased random order. In the special case 
where q is the uniform distribution on 5, we call it the size-biased order. 

Coding trees and mappings with marked walks Let T° be the set of plane (or- 
dered) rooted trees with n labeled vertices 1, 2, . . . , n, so that the children of any vertex 
V are distinguished as first, second, . . . The cardinality of T° is therefore n!C„ where C„ 
is the n-th Catalan number. For any T G T°, we may put its set of vertices in a spe- 
cial linear order fi,f2, ■ ■ ■ ,Vn called depth-first order, we let vi = root, and then vj+i is 
the first (oldest) child of vj not in {vi, . . . ,Vj} if any, or the oldest brother of vj not in 
{f 1, . . . ,Vj} if any, or the oldest brother of the parent of Vj not in {vi, . . . ,Vj}, and so on. 
Write ht^(f ) for the height of vertex v. For any weight sequence w = {wi, . . . , Wn) with 
Wi > for every i, let 

j— 1 i 

Hl{s) = h.t^{vi) if ^w^^ <s <^w^^, (5) 

i=i 3=1 

and let H'^(^.Wi) = ht^(i;„). Call the height process of T. Notice that any s G 
[0, '^^Wi) specifies a vertex of T, which is Vi in the case appearing in (jS)). We say that Vi 
is visited at time s by H^. Intuitively, picture a particle touring the vertices in depth-first 
order during the unit time interval, spending time Wi at vertex i. 

Given a mapping m with basins and cyclic points ci,...,ck in q-biased order for 
some q, we may associate to each a walk as follows. First, turn these unordered trees 
into plane trees by putting each set of children of each vertex in random exchangeable 
order, independently over vertices given Then associate to this ordered tree the height 
process H^"^ , with a slight abuse of notation, where we are again given a weight function 
w on [n] (though we use only the relevant labels appearing in 7^.). We can now define 
the walk associated with m to be 

\ i<j<i / j=i j=i 

and Hl^^YlnWi) = H^{J2i'^i~)^ where w{A) = J^ieA'^i- That is, we concatenate the 
tree-walks associated to 7^^ , . . . , 7^^, in this order. Again, there is a natural notion of 
vertex visited at time s < ^ ■ Wi. 
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Further, let D^{i) = J2]=i be the weight of the i first basins, so that D^{i) 

is the time when the mapping- walk has completely visited the vertices of the i-th 
basin, so w{Bi{m)) = D'^{i) — D^{i — 1) for i > 1 with the convention -D^(O) = 0. We 
also let P^{s) be the number of cyclic points that have been visited before time s, namely 

i i—li 

C("S) = ^'^{H^{wavl,...,VJ}))=o} whenever ^w^^ < s < ^w^., 
j=i j=i j=i 

with C(E.«^.)=C(E.^^-)- 



2.2 The Joyal functional 

We now define a functional J" on the Skorokhod space D[0, 1], which translates into the 
world of encoding paths the Joyal bijection (recalled below) between trees and mappings. 
Let u G [0, 1]. Define the pre-post infimum of / G D[0, 1] before and after u to be the 
function 



-s^'^'l I 'mit(.[u,s] ft for s>u. 



The function f{u) is non-decreasing on [0,m] and non- increasing on [u, 1]. If [a, 6] is a 
maximal flat interval for /(m), we call the recentered function ((/ — f{u)){s + a), < s < 
b — a) an excursion of / above f{u). Such a function may not be an excursion in the 
usual sense because it might be zero for some s G (0, 6 — a). Further, if two distinct such 
intervals [a,b] and [c,d\ satisfy = /(c), then it must be that b < u < c, and in this 
case we call the function obtained by concatenating the excursion of / above f{u) on [a, b] 
and [c,d] a (generalized) excursion of / above f{u). Label as El , • • • 

the generalized 

excursions of / above f{u), according to decreasing durations /i, /2, • • •• Write also hi for 
the "height" of the excursion Si, i.e. the value taken by f{u) on the fiat interval of the 
excursion. We define a function J"(/) that arranges these excursions in order of heights: 

r{f){s)=eAs- J2 lA if E ^^•^^< E (6) 

\ j:hj<hi I j:hj<hi j:hj<hi 

with the convention that J"(/)(s) = on !]• 

To keep track of the structure of the original function, we finally add marks at the 
points g^'if) = Ej:h,<h, Ij and c?^(/) = Ej:h,<h, ^i' ^ > 1- In particular, if J"(/) if non-zero 
on ((?"(/), df{f)), then the Ei is an "usual" excursion rather than "generalized" excursion. 



2.3 The Umiting process and main result 

Let us recall the construction 2J of the exploration process of the ICRT for G ©finite- 
Let {bg, < s < 1) be a standard Brownian bridge, Ui, . . . ,Uj be independent uniform 
random variables independent of b, and 

X^'^'^'is) = Bobs + E UMu.<s} -s), 0<s<l. 

i=l 
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Such a process has a.s. a unique time where it attains its overall minimum, and this time 
is a continuity time, call it s^i^. Define the Vervaat transform of X^'^'^ by 

X^{s) = + s^in[mod 1]) - ^^■••^(s^in) , < s < 1, 

and let U = Ui — Smin[niod 1], 1 < i < / be the jump times of . Let T, = inf{s > tj : 
Xf = Xf._} and write 

* [ else. 

Last, let = X^-^f^^ Rf. This process Y^ is continuous, and it is intuitively described 
by: "take away all the jumps of X^ and refiect the process above its infimum after these 
jumps until X^ gets back to the level it started at before jumping". The exploration 
process of is then defined as = ^Y^ . 

The open set {s G (ti, Ti) : H^{s) > H^iti)} associated with jump i can be decomposed 
into disjoint open intervals (tij,Tij),j > 1, ranked by decreasing order of lengths. 

Now take a uniform(0, 1) random variable U independent of Y^, and consider the 
process = 3^{H^). Recall that this process has marks [H^) , d'j^ [H^) , which we 
more simply call gi,di. The following facts are simple consequences of usual properties of 
Brownian motion and Brownian bridge: 

• The sum of durations of generalized excursions of above H^iU) is 1, meaning 

• The corresponding excursion heights hi,i>l are a.s. everywhere dense in [0,H^]. 

Now let Vi,V2,... be independent uniform(0, 1) variables, independent of U and H^. 
Define recursively a sequence = < -Di < -D2 < • • • < 1 by 

Dn = inf{s : s > -D„_i + — -Dn-i) and 3i > 1, s = di} n > 1. 

Last, we define the local time function of as follows: for s in an excursion interval 
of Z^ above 0, let Lf be the "height" of the corresponding generalized excursion of 
above H^. This defines on a dense subset of [0, 1] as an increasing function, which can 
be extended to the whole interval [0, 1] uniquely as a continuous function, because is 
itself continuous, and the excursion heights are dense in [0,H^]. Notice that this "local" 
time has the unusual property that its increase times do not exactly match with the zero 
set of Z^; rather, the set of increase times is the closure of {gi,di,i > 1}. 

Now let p„,q„,w„ be three sequences of probabilities on [n] charging every point. 
Consider a p„-mapping M„ with basins in q„-biased order, and let H^^ be the associated 
walk. We let if*^" := H^". Our main result is 

Pn 

Theorem 1 Suppose maxj ^ as n ^ 00. 

(i) Under the asymptotic regime ^ for p„, with limiting G ©finite? o-nd if 
helow are satisfied, then for any weight function w satisfying maxj Wni — ^ 0, we have the 
convergence in law in the usual Skorokhod topology on ©[0, 1] 
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(ii) Moreover, jointly with the above convergence, the marks D^^ (1) , D^"^ (2) , . . . con- 
verge in law to Di, D2, .... 

(iii) Jointly with the above convergences, (t(p„)£*^" ^ for the uniform topology. 

(iv) In general, under the asymptotic regime ^ forpn, with limiting 6 G ©finite; one 
has convergence in law for the *-topology defined in ^ 



and the convergences of (ii),(iii) hold jointly for w„ = p. 
We echo jH Corollary 1] by stating 



Corollary 1 Under / f3if^lf^) with finite-length limiting 0, and for any weight function 
w„, q„ with maxj max(w„j, g^j) — > as n ^ 00, we have 

(w„(S,(M„)),a(pOCardQ(MO,j > 1) ^ (D, - D,_,,L% - L%^^,j > 1). 

Notice that for uniform w„, the first component equals n~^Card (i3j(M„)). 

The essential point of the *-topology is the following property Ij. One has /" — > / 
for the *-topology, where G D[0, 1] and / G C[0, 1] is continuous, if and only of 
there exist gn,hn G D[0, 1] with fn = gn + hn such that Qn ^ f uniformly, > 
and Leb{a; : /i„(x) > 0} ^ 0. Thus the ^-convergence asserted in (iv) is compatible 
with the possible presence of upward "spikes" on the mapping-walk, which have arbitrary 
large height but vanishing weight. In particular. Theorem ^(iv) allows us to deduce the 
asymptotic "height" (distance to the set C(M„)) of a randomly p„-chosen vertex, but 
not the behavior of the asymptotic maximum height over all vertices, which is however 
handled under the hypotheses in (i). Under the same hypotheses, we can handle quantities 
such as the diameter of the random mapping (the maximal k such that there exists v with 
V, m{v), . . . , m^~^{v) pairwise distinct). 

Although this result leaves a large degree of freedom for choosing the order of basins, we 
stress that other orderings are possible, such as ordering the basins according to increasing 
order of the least vertices they contain, or ordering cycles by order of least vertex they 
contain. The first order is in fact equivalent to the size-biased order described above, up 
to relabeling, and the second order could be also handled by our methods, although the 
marks Di would have to be defined in a different way, see 0. 

Last, we stress that the hypotheses ()8f9|l below are by no means necessary, we believe 
that they are in fact quite crude (see [2j for further discussion). Also, as discussed below, 
we believe that Theorem ^(iv) remains true for much more general w„. 



3 Proofs 



3.1 p-trees and associated walks 

p-trees and their walks. We now define the random trees whose asymptotics are 
related to the process H^, namely p-trees. Let T„ be the set constituted of the n""^ 
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(unordered) rooted labeled trees on [n]. For p a probability measure charging every point 
of [n], let TP be the random variable in T„ with law 

P(TP = t) = n P?^*^ ' ^ e T„, (7) 

where Ci{t) is the number of children of i in t. The fact that ((Zj) indeed defines a probability 
measure amounts to the Cayley multinomial expansion for trees fOj- For t G T„, we can 
associate a random T°-valued tree t° by putting each set of children of a given vertex in 
uniform random order, independently over distinct vertices, so given a weight function w 
on [n] we may associate to t the random walk if* := as defined in section ITTl We will 
now apply this to the random trees and their associated height processes := . 
When w = p we let := H^. 

Asymptotics. We introduce two extra hypothesis on the sequence p„ besides (0). The 
first one prevents exponentially small (in the scale o"(p)) p- values from appearing: 

(minp„i)"^ = o(exp(«/(T(p„))) , Va > 0. (8) 

i 

The second states that "small" p-values are of rough order cr(p)^. Suppose there exists 
some non-negative finite r.v. Q such that, letting p = (0, . . . , 0,]9/+i,p/+2, . . .)) 



lim E 

n— »oo 



exp 



^[exp(AQ)] < oo (9) 



for every A in a neighborhood of 0. Here, ^ denotes a random variable with law p, so 
is its p„-value. 

The key results on p-trees are the following variations of ,2^ Theorems 1,3]. For k > 1, 
let X2, . . . be independent p-sampled vertices of T^, independent of T^. Let rfc(TP) 
be the subtree of spanned by the root and X2, . . . ,Xk, re-interpreted as a tree with 
edge-lengths, in the sense that two vertices separated by a single edge are at distance 1, 
and we delete all the nodes that have degree 2, so the distance between two vertices on 
the final tree is equal to the number of deleted nodes plus 1. The tree VkiT^) is thus 
a discrete rooted tree with at most k leaves, which has no degree 2 vertices, and with 
lengths attached to each of its edges. The notion of convergence on the space of trees with 
edge-length is the usual convergence for the product topology, so t„ — > t if both trees have 
the same discrete structures for all sufficiently large n, and the vector of edge-lengths of 
t„ converges to that of t. Last, for a > we let a ® t be the tree with edge-length with 
same discrete structure as t, and where all distances have been multiplied by a. 

Proposition 1 ([Tj) Suppose that Pn satisfies For every k, the tree cr{pn) ®rj:{T^") 
converges in distribution to , the k-th marginal oj the ICRT described in section^ 

Theorem 2 (i) Suppose that p„ satisfies (1333'? with 6 G ©finite? d'nd that w„ satisfies 
maxjW^j— i>0. Then 

for the usual Skorokhod topology (and hence for the uniform topology since the limit is 
continuous). 
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(ii) Under the assumptions of (i), for each 1 < i < I , there exist random sequences 
tf\Tf" andtf-,Tf-,j>l with 

H^litf") = H^litf;) = HlliTf-) for every j > 1, 

and H^lis) > H^l{tf") for s E [tf",Tf"], such that jointly with the convergence of (i), 
one has convergence in law 

{tf",Tr\tf;,Tr^",l< ^ < I, J > 1) ^ (t„T,,t,,,T,,,l < t < I, J > 1), 

with the notations of section \2.3[ 

(iii) Suppose only that p„ satisfies then 

in the *-topology. Moreover, the statement of (ii) still holds for w„ = p„. 

Proof. Except for the last sentence, point (iii) is a consequence of Proposition 7] 
which states that the convergence of marginals of p-trees to that of the limiting ICRT [21 
Proposition 1 and (23)] is equivalent to the ^-convergence of the rescaled walk (T(p„)ifP" 
with weights w„ = p„ to H^. 

Point (i) was proved in |2| Theorem 3, Corollary 3] in the two special cases where 
w„ = Pn and where w„ = (1/n, . . . , 1/n) (n times). The general case uses the same proof 
as Corollary 3 in the stated paper. By the weak law of large numbers for sampling without 
replacement applied to w„, we have supo<j<i |5'w„,o(^) —t\ ^ in probability, where 5*^^,0 
is the linear interpolation between the points {{J2i<k<i'^nn{k), i/n), 1 < i < n), and where 
TT is a uniformly distributed random permutation on [n] . This implies the result because, 
as shown in [2], the depth- first order on vertices vi,V2, ■ ■ ■ ,Vn of a p„-tree is a (random) 
shift of a uniform permutation of [n]. Therefore, the linear interpolation S^^ between 
points {{J2i<k<i'^nvk^'^/^)A < ^ < ■"■) also uniformly converges to the identity, and the 
conclusion follows from the fact that H^"^ = H^" o {Sp^)~^ o S'w„. 



./S A A.. 
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Figure 1. 
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Point (ii) refines one aspect of (i). First consider tlie case w„ = p„. By Skorokhod's 
representation tlieorem, suppose that tlie convergence in law of (i) holds almost-surely. 
Fix i. Figure 1 shows schematically (top left) three of the excursions of associated 
with jump i. All have the same height, h say. The lower left diagram in Figure 1 shows 
corresponding parts of ifP". Consider the minimum value hn{l) of H^" between Tjj(i) 
and tij(2), and the minimum value hn{2) of H^" between Tjj(2) and tij(3). The key claim 
is 

= /i„(2) for all large n. (10) 

To verify ()10|). take three independent uniform random variables Ui, 1/2,1/3 on [0, 1] in- 
dependent of , H^" ,n > 1. These random variables specify three p„-chosen vertices 
on TP", namely those which are visited by H^" at these times. On an event of positive 
probability we have Uk G itij(^k),Tij(^k)), k = 1,2,3. Consider the subtree of T^" spanned 
by the root and the three vertices encoded by Ui,U2,Us. If /i„(l) 7^ hn{2) then, on the 
above event, the subtree has an edge of length |/i„(2) — hn{l)\ (as shown in rightmost tree 
in Figure 1), but this is not converging to the correct limit asserted in Proposition ^ (in 
the sense of convergence of discrete structures mentioned above Proposition because 
the limit tree (the second-right tree in Figure 1) has different tree shape. Thus we can 
deduce (fTUj) using Proposition [T] It is then straightforward to deduce the full assertion of 
(ii) from the case ()10j) of three excursions. 

Treating the case of general weights w„ is done by asking (again by the Skorokhod 
representation theorem) that the uniform convergence of o S'p,^ to identity is also 
almost-sure. Then replace Uk, = 1, 2, 3 by U^" = S^^ o Sp^{Uk), k = 1,2, 3, so the new 
variables encode again p„-chosen vertices. The case of ^-convergence (for w„ = p„) is 
similar (see also the proof of [H Lemma 2]). □ 
Remark. To prove (iii) for more general weights w„, we could try to use the same method 
as above (first treating the case of uniform weights). But if /„ — / for the *-topology 
with / continuous, and if Sn is a strictly increasing piecewise linear continuous function 
that converges uniformly to the identity on [0, 1], then /„ o Sn need not converge to / 
for the *-topology. Indeed, with the above notation, this convergence is equivalent to 
Leb{x G [0, 1] : S'„(x) > 0} — > 0. But this last quantity is l{/i^>o}(5'~^)'(a;)da;. So 
we would need a sharper result than the weak law of large numbers for sampling without 
replacemement to estimate the values of the derivative at points where hn > 0. However, 
it was proved in [SJ Theorem 25] by different methods that in the asymptotically negligible 
regime (j21). Theorem ^(i) is still valid for general weights w„ satisfying maXjW„j —>■ 0. It 
would therefore be surprising if the same result did not hold here. 

3.2 The Joyal correspondence. 

Let us now describe the Joyal correspondence between trees and mappings, designed to 
push the distribution of p-trees onto the distribution of p-mappings. Let q be a probability 
distribution charging every point. Let Xq be the root of the p-tree and Xi be random 
with law p independent of T^. We consider Xi as a second root, and call the path 
Xq = ci, C2, . . . , ck = Xi from Xq to Xi the spine. Deleting the edges {ci, C2}, {c2, C3}, . . . 
splits TP into subtrees rooted at Ci, C2, . . . , ck, which we call Tii, • • • , %k- Orient the 
edges of these trees by making them point towards the root. Now let X2,X3, ... be an 
i.i.d. q-sample independent of TP. Consider the following procedure. 
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• Let Ti = 2 and ki be such that 7^^.^ contains X2. Bind the trees 7^^, . . . ,Tc^^ by 
adding oriented edges Ci C2 Cfc^ ^ Ci. Let Ci = {ci,...,Cfc^} and 
^1 = Ui<j<fc^7^.. 

• Given r^, ki,Ci, Bi,l < i < j as long as Ui<j<ji3i 7^ [n], let Tj+i = inf{A; : ^ 
Ui<j<j;Bj} and fcj+i be such that X:^.^^ contains X^-.^j^. Then add edges %+i — *■ 

%+2 — > . . . ^ %+l, let Cj+i = {Cfcj+l, . . . , %+i} , ^j+l = Ufc^.+i<i<A:^.^i'7^i. 

When it terminates, say at stage r, the procedure yields a digraph with r connected com- 
ponents Bi, . . . ,Br, and each component contains exactly one cycle of the form Ckj+i —>■ 
Ckj+i — > Ckj+i- Let J(T^,Xi,i > 1) be the mapping whose digraph equals the one 
given by the procedure. Then, as an easy variation of [1, Proposition 1], 

Proposition 2 The random mapping J(T^ , Xi,i >1) is a p-mapping, and the order on 
its basins Bi, B2, . . . ,Br induced by the algorithm is <\-biased order. 



3.3 Consequences for associated walks 

From now on, let be a p-tree, and be its associated height process. Let f 1, t>2, . . . , 
be the vertices of in depth-first order, and let be the linear interpolation between 
points {{Ylii<_j<iWj,i/n),{] < i < n). Given a random variable U uniform on [0, 1] and 
independent of H^, let Xi = Xi{U) be the vertex that is visited by the walk at time 
= o Sp{U), so this vertex is a p-distributed random variable independent of 
TP. We also let X2,X3,... be an independent q-sample, independent of T^,U. Let 
M = J(T^,Xi,i > 1) be the p-mapping associated to by the Joyal correspondence. 
We will prove Theorem ^i) by showing that the mapping- walk associated to M converges 
in law to Z^. 

Consider the slight variation of the process H_^{u): 

i^-p I \( \ — j MXfi'^)i^) if ^ is not a time when a vertex of the spine is visited 

^-'^''>^^> = \ Hl{u)is) + l else. 

This process thus "lifts" the heights of the vertices of the spine by 1. Recall from the proof 
of ^ Lemma 3] (with a slightly more general context that incorporates the weights w) 
that these vertices are visited precisely at the times for which the reversed pre-minimum 
process s t— > H^^_g-^_{u) jumps downward, so in K^{u) we just delay these jumps by 
the corresponding w-mass of the vertex. What we now call "excursion" or generalized 
excursion of above K]^{u) is just the same as before, that is a recentered portion of 
the path of on a flat interval of K^{u), with the convention that two excursions on 
two flat intervals with same heights (here and below the term "height" refers to the fiat 
intervals) are merged together as a single generalized excursion. By contrast with the 
above, these excursions may take negative values, but only at times when cyclic vertices 
are visited, where the excursions' value is —1. As above, let J"(ifP) be the process 
obtained by merging the excursions of above K^[u) in increasing order of height. A 
slight variation of [2 Lemma 3] gives 

Lemma 1 
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Notice in particular that is a functional of and Xi{U) alone, and does not depend 
on X2,X3, . . .. 

Proof of Theorem Let p„ satisfy Q with finite-length limit 6. We use Theo- 
rem 121 and Skorokhod's representation theorem, so we suppose that the convergence of 
(T(p„)ifP" (either in *-topology or Skorokhod topology according to the hypothe- 

ses) is almost-sure, as well as the convergence of , Sw,, , 5'q„ to the identity. We also 
suppose that the convergence of Theorem |21 (ii) is almost-sure. 

Fix e > 0. For (Lebesgue) almost-every m G [0, 1], m is not a local minimum of 
on the right or on the left. Fix such a u. Since u"'^ := S^^^ o S'p^(n) u a,s n ^ oo, 
it is easily checked that for any 77 > and n > Ni large enough, the processes H^ju""") 
and H^(u) (resp. (m'*^") and K^"^(u)) coincide outside the interval {u — rj^u + rj). Let 
£1, £2, • • • be the generalized excursions of above H_^{u), ranked by decreasing order of 
their durations li, I2, ■ ■ ., call hi, h2, . . . the corresponding (pairwise distinct) heights. Let 
a > be such that u{h) := sup^j^j.^.^^] — if.^||oo < e/3. Notice that for n > N2 large 

enough, we also have ujn{h) := a(p„) sup;,g[_„^„] WH^ii- + h) - ifP'^(-)||oo < Next, 
take k such that Yl'i=i — 1 — a/2, and choose r] < a/A such that none of the intervals 
of constancy of H^{u) corresponding to these k excursions intersect {u — r],u + rj). 

Next, consider hypothesis (i) of Theorem ^ If [a, h] is an interval of constancy of 
H^{ii"") (or Hf{u)) not intersecting (u — r],u + 1]), then there exists for n large enough 
a constancy interval of K^"^{u), which we denote by [a", 6"], such that (a", 6") {a,b), 
implying by Theorem |21^i) that 

(a(p„)(i/P:(a" + s)- Hllia-)), Q < s < - a^) ^ {H%a + s) - H%a), < s < 6 - a) 

uniformly. Moreover, for u as chosen above, if m G {ti,Ti) (notice m = Tj or m = is 
not possible) then there exists some tij,Tij with t.y < u < Tij. Thus, for such u and 
as a consequence of Theorem |21 (ii), if there exists a second such flat interval [c,d] with 
same height as the initial one (with say b < c), then there also exists a constancy interval 
[c", d"^] of K^"^{u) with (c", rf") (c, d), with the same height as the first one. Therefore, 
these two intervals do merge to form the interval of a generalized excursion of (T(p„)ifP^ 
above a{pn)K]^^{u) with length (6" — a") + (d" — c"), that converges uniformly to the 
generalized excursion of above H^{u) with height iff and duration {b — a) + {d — c). As 
a conclusion, one has — > Ei uniformly for every 1 < i < k, where is the generalized 
excursion of cr(p„)ifP^ above cr(p„)fCP'^(u'^") with i-th. largest duration Z". Call its 
height. 

Now (/i", . . . , /i^) (/ii, . . . , /tfc), and Z]i<i<fc 1^? ~ ^ as ?! ^ 00. Thus, if 
n > max(A''i, N2) is also chosen so that 

• Ei<.< J^r - y < «/2, 

• /i", . . . ,h1 are in the same order as /ii, . . . , (recall these are pairwise distinct), 

• supi<i<fc lie," -eilloo < e/2, 

then necessarily, the uniform distance between (j(p„)J""" (if^") and J"^""" {H^) is at most 
e. Indeed, for x G [0, 1], if x G ((7", c?") fl {gi, di) for some i < k, then 

|a(p„)J"^"(/iP:)(x)- J"^"(i7^)(x)| < ||£,-£ri|oo+ sup - e,{- + h)\\^ < e, 

\h\<a 
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and else the value taken by this difference does not exceed u{h) +Un{h) < e because there 
must be a zero of both processes at distance < a from x. Apply this to u = U, which 
a.s. does not belong to the set of local minima (on the left or on the right) of H^. Using 
Lemma H establishes the assertion of (i). 

The case (iv) of ^-convergence follows the same lines as in jTJ Lemma 2]. We suppose 
up to extracting subsequences that a{pn)H^" can be written as Qn + hn with gn converging 
uniformly to and hn{u) = ultimately for almost-every u. Then, up to modifying 
slightly the constancy intervals of K^", the same result as above holds for Qn, so this 
proves that (T(p„)J^(ifP") converges to in probability for the *-metric. 

Points (ii,iii) in Theorem then follow the same lines as in the proof of Theorem 
1]. We give some details for (ii). Let U2, U3, ... be uniform independent random variables, 
independent of H^, U, {m-.n > 1). Let U^" = ^"^ o S^^{Ui) for i > 2. Recall that the 
walk -ff*^" can be defined using only H^",U, so we are allowed to make the following 
choice for X2, X3, . . .: we let Xi be the vertex visited by -f^^l" time U^. Therefore, the 
marks D^^{i) are obtained recursively as follows: let v be the vertex visited by the first 

> D^^{i), then D^^{i + 1) is the first time when a cyclic point is visited strictly after 
V, i.e. at the right end of the generalized excursion of -ffwi" straddling this U^. Passing 
to the limit, we find that {D^^{i), I < i < j) converges a.s. to {D'^, 1 < ^ < j) defined 
recursively by: -D^+i is the first point of {di, d2, d^, . . .} that occurs after the first Uj > D[. 
It is easy to see that this defines a sequence with the same law as Dj, i>l. □ 

4 Inhomogeneous continuum random tree interpre- 
tation 

Let us briefly introduce the details of the limiting ICRT's stzcA;-6reaA;mg' construction [311]. 
Let 6 = {9q, 9i, 02, . . .) satisfy J2i>o ^1 — 1- Consider a Poisson process {Uj, Vj),j > 1 on 
the first octant O = {{x,y) G : < y < x}, with intensity 6q per unit area. For each 
i >1 consider also homogeneous Poisson processes > 1) with intensity 6i per unit 

length, and suppose these processes are independent, and independent of the first Poisson 
process. The points of M+ that are either equal to some Ui,i > 1 or some > 2 will 
be called outpoints. To a cutpoint rj we associate a joinpoint r]*: if 77 is of the form Ui, 
let rj* = Vi, while if rj = for some i > l,j > 2, we let t]* = Under the hypothesis 
J2i one shows that we may order the cutpoints as < ?7i < 772, • • • almost- surely. We 
build recursively a consistent family of trees whose edges are line-segments by first letting 
Tf be the segment [0,rii] rooted at 0, and then, given Tf, by attaching the left-end of 
the segment (?7j,?7j+i] at the corresponding joinpoint 77}, which has been already placed 
somewhere on Tf. Further, we relabel the joinpoints of the form 1 as i, and we relabel 
the leaves r]i,r]2,... as 2-|-, . . .. When all the branches are attached, we obtain a 
random metric space whose completion we call (it can therefore be interpreted as the 
completion of a special metrization of [0, 00)). We let [[fjUi]] be the only injective path 
from V to w, and jjfjw]] = \ {"^l- 

Together with the ICRT comes one natural measure, which is the length measure 
inherited from Lebesgue measure on [0, cx)). When 6 satisfies the further hypothesis 

> or ^ . 6i = 00, the tree can be endowed with another measure /x, which is a 
probability measure obtained as the weak limit of the empirical distribution fij on the 
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leaves 1+, 2+, . . . , J+ as J — > oo. We call /i the mass measure. 

If ^ G ©finite; it has been shown in 2j that is the exploration process of T^. To 
explain what this means, note first that induces a special pseudo-metric on [0, 1] by 
letting 

d{u,v) = H! + H!-2 \ni Hi 

w(^[u,v\ 

It turns out that the quotient space T obtained by identifying points of [0, 1] at distance 
has the same "law" as T^, where the mass measure is the measure on the quotient 
induced by Lebesgue measure on [0, 1]. Precisely, 

Theorem 3 (|[2J) //f/i, ■ ■ ■ ,Uj are independent uniform variables on [0, 1], independent 
of , then the subtree ofT^ spanned by the (equivalence classes of the) Ui 's has the same 
law as Tj . 

Conceptually, the stick-breaking construction provides an "algorithmic construction" of 
the ICRT, whereas the process plays a role similar to that of Brownian excursion in 
our methodology described in point (ii) in the introduction. 

We now show how some consequences of our main theorem can be formulated in terms 
of the stick-breaking construction of the ICRT. For v G T^, let junc(w) be the branchpoint 
between v and Define recursively a sequence = Cq, Ci, . . . of vertices of the spine 
[[root, H-]] with increasing heights recursively using the rule 

Given Cj let kj+i+ be the first leaf of {2+,3+,4+, . . .} with junc(A;j+i+) ^ 
[[root, Cj]] and let Cj+i = junc(/cj+i+). 

Corollary 2 Under regime ^ with limiting 6 G ©finite; 

(p„(i3,(M„)),a(p„)Card(C,(M0),J > 1) 

( lim — Card{l < i < k : junc(i+) g]]c,_i, c,]]}, ht(c,) — ht(c,„i), j > 1 J 

\k—>oo k J 

Proof. The n ^ oo limit of the left side is (by Corollary^ the law of 

{D,-D,_^,L'^^-L%_^,j>l). (11) 

By the description of /i as the — > oo limit of the empirical distribution on leaves 
2+, . . . , k+}, the k ^ oo limit of the right side of Corollary El becomes 

{fi{v G : junc(?;) e]]c,^uCj]]}M{cj) - ht(c,_i),j > 1). (12) 

So the issue is to show equality in law of (fTT|) and (fT^ . But Theorem El identifies the law 
(fT^ with the law 

iLeh{v G (0,1) : junc(t;) g]]c,_i, c,]]}, if,^. - j > 1) (13) 

where the quantities involved can be redefined as follows. Take Ui, U2, f/3, . . . uniform on 

(0, 1), independent of H^. Let junc(t') be the point at which inf[i; [/^j or inf[{/^ is 

attained. Given Cj, let c^+i = junc(f/') where U' is the first of {U2, U3, f/4, . . .} such that 
170 ^ zje 
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On the other hand, Di is by definition equal in law to the sum of the lengths of 
the generalized excursions of above H^{Ui) whose heights are less than or equal to 
that of the excursion containing an independent uniform U2-, while is the height of 
the corresponding excursion. Recursively, -Dj+i — Dj is equal in law to the sum of the 
durations of the excursions with heights between the height of the previously explored 
excursions (strictly) and the height of the excursion straddling the first Ui that falls in 
an excursion interval with height larger than the previous ones; — is then the 

difference of these heights. This identifies the law flllj) with the law (fT^ . □ 
Remark. Corollary |21 could alternatively be proved, for more general limit regimes, by an 
argument based directly on the Joyal correspondence, without using the detour through 
exploration processes. 

5 Final remarks 

The regimes (jHl) are basically the only possible ones, if we require a limit distribution for 
the number |C(M„)| of cyclic vertices. 

Lemma 2 //c„(|C(M„)| — dn) converges in law to some non-trivial distribution on R+ 
for some renormalizing sequences c, d, then there exists 6 such that p satisfies (0) up to 
elementary rescaling, that is, there exists a G (0, oo) and /? G M such that c„/(j(p„) a 
and Cndn (3. 

This lemma is a direct consequence of Theorem 4] and of Proposition |2l which 
implies that the number of cyclic points of a p-mapping has same distribution as one plus 
the distance from the root to a p-sampled vertex of a p-tree. 
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